A Case Frame Learning Method for Japanese Polysemous Verbs
نویسنده
چکیده
This paper presents a new method for learning case frames of Japanese polysemous verbs from a roughly parsed corpus when given a semantic hierarchy for nouns (thesaurus). Japanese verbs usually have several meanings which take different case frames. Each contains different types and numbers of case particles (case marker) which turn select different noun categories. The proposed method employs a bottom-up covering technique to avoid combinatorial explosion of more than ten case particles in Japanese and more than 3000 semantic categories in our thesaurus. First, a sequence of case frame candidates is produced by generalizing training instances using the thesaurus. Then to select the most plausible frame, we introduce a new compression-based utility criteria which can uniformly compare candidates consisting of different structures. Finally, we remove the instances covered by the frame and iterate the procedure until the utility measure becomes less than a predefined threshold. This produces a set of case frames each corresponding to a single verb meaning. The proposed method is experimentally evaluated by typical polysemous verbs taken from one-year newspaper articles.
منابع مشابه
Classifying Japanese Polysemous Verbs based on Fuzzy C-means Clustering
This paper presents a method for classifying Japanese polysemous verbs using an algorithm to identify overlapping nodes with more than one cluster. The algorithm is a graph-based unsupervised clustering algorithm, which combines a generalized modularity function, spectral mapping, and fuzzy clustering technique. The modularity function for measuring cluster structure is calculated based on the ...
متن کاملA Frame-based Approach to Polysemous Near-synonymy: the Case with Mandarin Verbs of Expression
In this paper, we propose a frame-based approach to polysemy by analyzing three near-synonymous verbs biaoshi (表示), biaoda (表達) and biaolu (表露). Based on Liu and Wu (2004), this paper further discusses the cross-frame phenomena of near-synonyms with a detailed comparison of their syntactic and collocational patterns. It is shown that polysemy among related verbs may be well defined and manifest...
متن کاملSense Classification of Verbal Polysemy based-on Bilingual Class/Class Association
[n the field of statistical analysis of natural language data, the measure of word/class association has proved to be quite useful for discovering a meaningtiff sense cluster in an arbi trary level of the thesaurus. In this paper, we apply its idea to the sense classification of Japanese verbal polysemy in case frame acquisition from Japanese-English parallel corpora. Measures of bilingual clas...
متن کاملVerbal Case Frame Acquisition From A Bilingual Corpus: Gradual Knowledge Acquisition
This paper describes acquisilion of English stillace case flames from a corpus, based on a gradual knowledge acquisition approach. To acquire and unambiguously accumulate precise knowledge, the process is divided inln three steps which are assigned to the most appropriate processor: either a human or a computer. The data is prepared by human workers and the knowledge is acquired and accumulated...
متن کاملSense Classification of Verbal Polysemy based-on Bilingual Class/Class Association
[n the field of statistical analysis of natural language data, the measure of word/class association has proved to be quite useful for discovering a meaningtiff sense cluster in an arbi trary level of the thesaurus. In this paper, we apply its idea to the sense classification of Japanese verbal polysemy in case frame acquisition from Japanese-English parallel corpora. Measures of bilingual clas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002